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Abstract 

Firm growth process in the developing economies is known 
to produce divergence in their growth path giving rise to bi- 
modality in the size distribution. Similar bimodality has been 
observed in wealth distribution as well. Here, we introduce 
a modified kinetic exchange model which can reproduce such 
features. In particular, we will show numerically that a non- 
linear retention rate (or savings propensity) causes this bi- 
modality. This model can accommodate binary trading as 
well as the whole system-side trading thus making it more 
suitable to explain the non-standard features of wealth distri- 
bution as well as firm size distribution. 



1 Introduction 



Q^The power-law structure of the firm-size distribution in the 
'developed economies have been investigated thoroughly in the 
econophysics literature [TJ |2j |3j [4] as well as in the economics 
^ [5]. Different classes of models have been proposed that are 
OO able to replicate that feature [2j [6] [8] [9] . For the developed 
economies there are two more well established characteris- 
tics defining firm growth process viz., Laplace distribution 
of growth rate and power law decay in fluctuation with size 
[lOj [TTJ [12j [13]. Here, we focus on another statistical regu- 
larity known for long in the economics literature that in the 
developing economies the size distribution of firms shows bi- 
J* modality where most of the firms are either very large or very 
. small [14] . This particular feature can be dependent on many 
^ factors. Here, we will discuss one possible mechanism based 
H on the literature of the kinetic exchange models. 
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We will introduce a modified kinetic exchange (KE hereafter; 
first proposed in Ref. p~5j[16], for details see [HI [18]) model 
that can replicate this feature in a non-trivial way. The vari- 
able of interest is the firm size which is usually measured in 
terms of its work force. Following Ref. [T9] , we assume that 
the workers change their workplace every year (the turn-over 
rates measuring inflow and outflow of workers are very high in 
the formal sectors). Hence, every firm retains a certain frac- 
tion of workers and fires the rest (or the workers themselves 
decide to leave) giving rise to a permanent cycle of workers 
across the firms. Though, we borrow the tools of the KE 
model, clearly we cannot use a binary exchange mechanism 
here. In particular, we will extend the usual KE model in 
two directions. First, we will use n-ary trading (or collisions, 



where n < N the system size) as has been done in Ref. [19] 
and secondly, we shall consider the retention rate A to be a 
function of the evolving variable, the work- force w (known as 
the savings propensity and wealth respectively in the wealth 
distribution literature) . 

Here, we present some evidence on such divergence in the 
growth process of firms in the developing economies. Fig. [I] 
shows the distribution of all publicly traded firms in USA and 
Japan. Clearly, the data shows unimodality. Here, one must 
be careful because this data-set is limited as it includes only 
the publicly traded firms (Data source: Compustat; identi- 
fier: GVKEY). Since it is not exhaustive, it does not show 
the power-law tail. BLS [20] would be more appropriate for 
that purpose (used in e.g. Ref. pQ). However, since we are 
concerned about the uni-modality rather than the exact dis- 
tribution, this data set suffices. Next, we show that there is a 
significant pattern in development process where the poorer 
a country becomes, the more it loses the middle-sized firms. 
This is known as the 'missing middle' [14]. Here is a gen- 
uine problem as such detailed data-bases are not available for 
the poor economies precisely for which we need data to es- 
tablish bimodality. One major reason for this lack of data is 
that most of the firms are unregistered and hence not counted 
leading to a lower count for micro firms. However, we can use 
the SME database [21] which describes the fraction of em- 
ployees in the micro, small, medium and large firms across 
the economies (see appendix for data description). The re- 
sult is shown in the table below (rows: firm size distribution, 
columns: income groups; see appendix). Clearly, in the less 
developed economies the middle sized firms are missing [13] . 





micro 


small 


medium 


large 


High 


35.6 


15.6 


15.4 


33.4 


Upper middle 


25.8 


13.2 


19.4 


41.7 


Lower middle 


29.1 


10.2 


16.0 


44.7 


Low 


11.2 


7.4 


9.2 


72.2 
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We therefore, arrive at the conclusion that the develop- 
ing economies are characterized by non-fixed, size-dependent 
heterogeneity giving rise to bimodality and the developed 
economies are characterized by fixed heterogeneity giving rise 
to Zipf's law in the firm size distribution. It is noteworthy 
that in the literature, bimodality in the wealth distribution 
is also seen, most prominently in Argentina [22] and in in- 
come distribution [23 . We will also show that our model 
can replicate bimodality even with binary trading mechanism 
thus explaining the bimodal asset distribution as well. It is 
noteworthy that there is evidence for bimodality in the world 
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Figure 1: Prob. density functions of sizes of all publicly 
traded firms (2000-2011) in USA and Japan(inset). Uni- 
modality is evident. Data source: Compust at (identifier: 
GVKEY). Note that this dataset is severely limited due to 
exclusion of publicly untraded firms and hence, it does not 
show the Zipf's law pQ. The log of firm-size has a good fit 
with a normal distribution indicating that the distribution of 
firm-size is log- normal (color online). 



wide income as well (a bunch of poor economies form a mode 
and the rich economies form another; see e.g. [24 j, see also 
[25] ) though our model is not directly relevant there. 

2 The model 

Consider an array of N firms with size Wi {i < N). Fol- 
lowing [10] we adopt the rule that the firm-size is determined 
by the number of their employees. Ref. [19 first applied 
the kinetic exchange model to explain firm-size distribution. 
Here, we modify the retention rate (which corresponds to the 
savings propensity (A) in the usual KE models) as a function 
of the current labor force w. This process captures the more 
realistic scenario that as a firm increases its work-force, the 
more workers it retains (or in the context of wealth distribu- 
tion, a richer person saves more). In general, we suppose that 
the number of firms from which the workers are leaving and 
moving into, is n. At each time point (1 — X(w)) fraction of 
the workforce of those n firms leaves. So there would be a 
total pool of workers that wants to change their workplace. 
Next, this pool of workers is randomly divided into those n 
firms. Hence, the dynamics is given by the following set of 
equations, 

n 

Wl (t + 1) = X( Wl (t)) Wl (t) + ei (t+ i)5^(l - X( Wj (t))) Wj (t) 



Wi (t + 1) = X(w i (t))w i (t)+e iit+1) J2^-Hwj(t)))w j (t) 



n 

w n {t + l) = \{w n (t))w n {t) + e n ( t+ i) - \( Wj (t))) Wj (t) 

such that Yl'j e j(t) — 1 f° r a U t ( see appendix on how to 
generate e). As is evident from above, this is a straight gen- 
eralization of the usual kinetic exchange models (with n = 2) 
that has primarily been used to study the income/wealth dis- 
tribution models (see Ref. [13 [18]). For notational clarity, 
define w = [wi,W2, . • . , w n )' at every time period t and con- 



sider n = N i.e. the workers change their respective jobs 
across the whole population of firms . Let us also denote the 
transition matrix by T\( w y Therefore, the system of equa- 
tions shown above can be compactly presented as 

w(t + l)=T x(w) w(t). (1) 

Let us assume the following functional form of A, 

\(w) = ci(l — exp(— C2w)). (2) 

Simulation shows that the system stabilizes to a certain dis- 
tribution of w* which is a fixed point of Eqn. [l]i.e. w* solves 

™*=T x{w * ) w*. (3) 

Note that with constant c\ £ [0,1], — » oo and binary in- 
teractions, the model is the CC model [16]. However, for 
C2 << oo, the retention rate A becomes a non-linear function 
of w. The analytical solution of this system is not known (even 
at the limit C2 — » oo). Hence, we present numerical results 
only. We considered N = 10 3 and ^ N Wj = N. Given c\ 
and C2, after simulating the system for 0(1000) time periods 
we arrive at a steady state distribution. We take average over 
O(100) of such steady states to arrive at the final distribution. 

2.1 Results 

Fig. [2] shows the steady state distributions for different values 
of C2 with c\— 0.95. Given the distribution of workers, one 
can find out the corresponding distribution of the retention 
rate A as well (see inset of figure [2]). Next, we find out the 
region in the ci, C2 space where bimodality appears. For that 
purpose, we simulate the distribution given a pair of values 
for ci and C2. Then we apply Hartigan's dip test to conclude 
if the distribution is bimodal or not (see Fig. |3|. We present 
the results at three different levels of significance. Note that 
here the choice is binary, either we accept a distribution to be 
bimodal or not. Hence, in Fig. [3] the degree of bimodality is 
not shown. 

Ref. [26 shows the existence of bimodality in the KE models 
by assuming the existence of two separate classes of agents 
with two widely differing savings propensities (A). Here, we 
adapt our model to include that possibility as well. We con- 
sidered the following functional form, 

X(w) = ci + (1 - 2ci) | -— , \ \ / \ i • (4) 

V ; 1 V 1J \l + exp(-(w-(w))/c2)J V ; 

The result is shown in Fig. |2j It is obvious that c\ < 1/2. The 
corresponding parameter space where bimodality appears is 
also shown in the same figure. Note that in the limit C2 —> oo, 
this model coincides with the usual CC model with A =0.5 
[T6] . On the other hand, if C2 — >• 0, then we get the model 
proposed in [26] . 

2.2 Transition from purely exponential to 
Zipf's law 

So far we have considered all the firms to be characterized 
by the same c\ and C2. Hence, they are ex- ante homogeneous 
but ex-post heterogeneous as their evolution depends on their 
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Figure 2: Emergence of bimodality in the basic model with 
Eqn. [2j is shown with the variation of the parameter C2 with 
ci =0.95 (color online). In the inset, we show the correspond- 
ing distribution of the retention rate A. 
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Figure 5: This diagram shows the transition from purely ex- 
ponential to a bimodal distribution with a power law tail due 
to changes increase in C2 from to 1. The inset shows a com- 
plete power law with coefficient -2 for very high values of C2- 
In all cases, c\ is uniformly distributed for all firms, (color 
online). 
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Figure 3: The region in the parameter space(ci,C2) where 
bimodality appears with Eqn. [2] (red squares; color online). 
For all parameter combinations we have simulated the model 
and then applied Hartigan's dip test for the presence of bi- 
modality. Panel (a) shows the area where level of significance 
is 1%. In panels (b) and (c), the corresponding levels are 
5% and 10% respectively indicating higher acceptance rate of 
bimodality. 
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Figure 4: Steady state distributions with the sigmoidal re- 
tention function given by Eqn. [4] Panel (a): This graph 
shows disappearance of bimodality as we vary parameter C2 
(ci = 0.3). Panel (b): The region (red squares) in the param- 
eter space where bimodality appears (color online). 



specific stochastic histories. We can introduce ex- ante hetero- 
geneity as in the CCM model [18] where all agents are char- 
acterized by different c\ which is fixed over time. With such 
modification, this model shows a very prominent transition 
from a purely exponential distribution to a bimodal distribu- 
tion with a power law tail, eventually converging to a simple 
power law. We assume c\ to be distributed uniformly between 
and 1 and fixed for all firms over time. For C2 = 0, the dis- 
tribution is a purely exponential one as expected (since all As 
are zero). For C2 = 1, we have a bimodal distribution in the 
log- log plot with a power law tail (see Fig. [5|. For higher 
values of C2, the distribution converges to a power law with 
coefficient -2. We plotted the data for C2 = 10, 20, 30, 40 and 
50 in the inset of Fig. [5] 

2.3 Applications in wealth distribution 

It has been claimed in Ref. [22 , that wealth distribution 
shows bimodality in certain countries. Here, we consider the 
same model with binary interactions as in the basic kinetic ex- 
change models. All the qualitative results hold true with very 
little quantitative changes (see Fig. [6|. However, with 7V-ary 
interactions, the system reaches steady state much faster than 
with binary trade. 

3 Zipf 's law and Laplace distribution 

Ref. [18 derives power law from fixed heterogeneity (assum- 
ing a distribution of A fixed for all agents over time). The 
mean-field equation boils down to 

\(w\) = constant. (5) 

Assuming /3(l,n) distribution of A, (w) becomes inverse of a 
beta distribution which is known to be a power law. However, 
this model when applied to model firm dynamics [19] does not 
show the Laplace distribution of growth. Here, we show that 
if we consider multiplicative conservation instead of additive 
(as in the usual KE models), then the Zipf's law and the 
Laplace distribution in growth are the natural outcomes even 
without any heterogeneity. Consider the following evolution 
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Figure 6: Outcome of the model with binary trading mecha- 
nism corresponding to the usual kinetic exchange models with 
A defined by Eqn. [2] (color online). Quantitative differences 
are minor with the case considered with 7V-ary trading (Fig. 



of ' 



w(t + l)=T x{w) w(t). 



(6) 



To assume complete homogeneity of the firms, define c\ — 
i.e. X(wj) = V j. However, lets also redefine measure of 
workers mj = exp(wj). Note that this corresponds to an 
multiplicative conservation where Yl m j = C a constant. This 
equation is readily identified as an production possibility sur- 
face of an economy with firms engaged in production sharing 
the same resources available in finite amounts (this is similar 
to a consumer's indifference curve). So with the same amount 
of resources, the economy can move along the hyper-surface. 
We know that 

p(w) = T exp(-w/T) where, T = (w). (7) 

Assume T — 1 and recall that m — log(w). Therefore, 

r log(9) 



p(m < 0) 



p(w)dw 



1-1/0 



(8) 



and by differentiation , we get p(m) oc m~ 2 i.e., we recover 
Zipf's law pp. Note that the growth rate g t +\ = log(m t +i/m t ) 
is the difference of two uncorrelated exponential distributions 
(as defined in [10 ), 



9t+i - w t 



(9) 



and hence, has Laplace (bi-exponential) distribution (proof is 
omitted) |10]. 

4 Summary 

In this paper, we have described a kinetic exchange model 
with a retention rate A (or savings propensity [16]) depen- 
dent on the mass of workers w (or wealth) and we have shown 
numerically that it leads to prominent bimodality in the size 
distribution of firms as has been empirically found in the de- 
veloping economies [14] and in the wealth distribution [22] . 
To apply our model to firm size distribution, we consider an 
7V-ary interaction (see Sec. [2| whereas to apply it to model 
wealth distribution, we consider binary interactions with little 



quantitative change in the results (see Sec. 2.3). The whole 
system is described in Eqn. [I] and [2] As is evident all agents 
are characterized by the same parameters describing A viz., c\ 
and C2 . The emergence of bimodality as the system organizes 
itself, is evident in Fig. [2] We also show the area in the pa- 
rameter space where bimodality appears in Fig. [3] To accept 
or reject bimodality we apply Hartigan's dip test [27 . Ref. 
[26] considered the possibility of generating bimodality in the 
basic kinetic exchange model by imposing two widely sepa- 
rated value of A. We accommodate that possibility as well by 
assuming a sigmoidal form of A in Eqn. [4] The results are 
summarized in Fig. [4] as well as the corresponding parameter 
space where bimodality appears. 

Thus in all these cases we assume ex-ante identical agents 
(that is with identical parametric description) but the agents 
are ex-post heterogeneous as their evolution depends on their 
own stochastic paths followed in history. Next, we introduce 
ex-ante heterogeneity on the agents as in the CCM model [T8] . 
We show how the distribution shows a transition from a purely 
exponential to bimodality with a power law tail which eventu- 
ally show a full power law tail with changes in the parameters 
describing the system. We assumed the same functional form 
for A as in Eqn. [2] but Ci is uniformly distributed (within 
and 1) over the agents as in the CCM model where c\ dif- 
fers across agents but are fixed for all agents over time (see 
Sec. 2.2). In Fig. [5j we show the distributions correspond- 
ing to different values of c^. For = 0, the distribution is 
exponential and as we increase the distribution shows bi- 
modality with the right mode having a power law tail. As we 
increase C2 even further, the distribution shows a full power 
law as in the CCM model. In Sec 12.31 we discuss the case 
with binary trading and numerically found that the results 
are almost identical. Lastly, we consider a completely homo- 
geneous agent model and showed that the Zipf's law [T] and 
the Laplace distribution [10] of fluctuation can be easily ac- 
commodated in a slightly different version of the same model. 

As has been argued in the literature, bimodality is only a 
transitional feature seen in the developing economies [14] or 
in times of economic instability [22] whereas unimodality is a 
sign of stable economy [15] . We see that ex-post heterogeneity 
(agents are ex-ante identical but because of the dependence 
of A on w they are ex-post heterogeneous) induces bimodality. 
However, as the economy stabilizes the heterogeneity becomes 
ex- ante as in the CCM model [18] giving rise to a power law 
distribution. 



5 Appendix 
5.1 On data 

We have used the data from Ref. [21] which declares that 
this data set has secondary data and hence, since sources 
and definitions differ across countries, there may be errors 
and there are discrepancies. Therefore, we have removed all 
the countries with incomplete data and then those countries 
with incomparable definitions of micro, small and medium 
firms. We followed the rule that micro firm has less than 
10 employees, small and medium have <40 or 50 and <250 
or 300 respectively. The rest are large firms. We ended up 
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with 22 high income, 3 upper middle, 13 lower middle and 3 
low income countries and took average over them to arrive at 
table [TJ We note that micro firms also have a low mass in 
the low-income group which is probably due to exclusion of a 
large number of unregistered micro firms. 

5.2 Simulating e 

For two agents, e\ is uniformly distributed between zero one 
and we define 62 = 1 — e\ . For the general case, note that 
the problem of generating the vector (ei, 62, . . . , ejv) such that 
€j = 1 is basically the problem of sampling uniformly 
from an TV — 1 dimensional simplex. We follow the algorithm 
described below. 

1. Generate a vector of N variables distributed uniformly 
within and 1. 

2. Take log of them and multiply with -1. 

3. Normalize each of the resulting variables by the sum of 
them so that their sum is 1. 
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